HODGERANK IS THE LIMIT OF PERRON RANK 



NGOC MAI TRAN 



(N 



Abstract. We study the map which takes an elemcntwise positive matrix to the k-th 
root of the principal eigenvector of its k-th Hadamard power. We show that as k tends to 
one recovers the row geometric mean vector and discuss the geometric significance of 
this convergence. In the context of pairwise comparison ranking, our result states that 
HodgcRank is the limit of Perron Rank, thereby providing a novel mathematical link 
■ between two important pairwise ranking methods. 

a: 

Let X = [Xij] be a n x n matrix, /C := R" xn be the open cone of elementwise positive 
matrices. For X G JC, let v(X),X(X) denote its unique principal eigenvector-eigenvalue 
£^ ; pair in PR™ -1 xR, X^ := [X k 3 ] be its k th Hadamard power. It is known that if the tropical 
max-times eigenvector m(X) of X is unique in PR™" 1 , then it is the coordinate-wise limit 
p-] ■ of the sequence v(X^) 1 ^ k as k — > oo [1]. 

lim v(X {k) ) 1/k = m(X) 

k—>oo 

Our first theorem states that the same sequence converges coordinate-wise to the row 
geometric mean of X as k — > 0. 
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Theorem 1. For X 6 K, define h{X)i = (JJXy) 1/n . Th 

(N '. i=i 
S : \imv(X {k) ) 1/k = h(X) 
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This result plays an important role in the context of pairwise ranking, where X is 
restricted to the subvariety of multiplicative comparison or symmetrically reciprocal ma- 
trices {X G /C : X^ = 1/Xji} [31 El El ES] • Here X^ measures the multiplicative preference 
of i over j. A ranking algorithm takes X as input and returns a score vector s E PR n_1 
by which the items are ranked. The triple v(X),m(X), h(X) are outputs of three ranking 
algorithms: Perron Rank [HI d7j, Tropical Rank [SI El and HodgeRank [21 [T2]. Perron 
Rank plays a fundamental role behind the Analytic Hierarchical Process [TH], a rank- 
ing procedure extensively applied in decision making. On the other hand, HodgeRank 
is closely related to many pairwise ranking algorithms in the rank learning literature of 
computer science [H EH E2] • A number of papers have been devoted to their comparisons 
Bfl El S E2l ESI ESI EH] . However, Theorem El is the first to show that HodgeRank can be 
obtained as the limit of Perron Rank, thus providing a mathematical tool for comparing 
pairwise ranking methods in computer science to those often used in decision making. 

Hadamard powers of a multiplicative comparison matrix arise naturally when one at- 
tempts to apply Perron Rank to additive comparison matrices, those which arise often in 
computer science applications [HI EDI Ell E2j • These are skew-symmetric matrices A2R™ 
where Ay measures the additive preference of i over j. They are in natural bijection with 
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multiplicative comparison matrices via the exponential/log map, however, the bijection 
is not unique since one is free to choose the base value. Changing the exponential base 
is equivalent to multiplying the additive matrix A by a positive constant, which can be 
interpreted as changing the measurement unit. This corresponds to taking Hadamard 
power of the mupliticative matrix X. Thus, if the input is an additive matrix A and a 
priori no base is preferred, one should consider all the Hadamard powers [exp(/cAjj)] when 
applying Perron Rank v : X H- v(X). This yields the Perron family of ranking methods 
Vk : X i—)- v(X^) 1 ' k for any k G (0, oo), with HodgeRank and Tropical Rank appear as 
limiting cases. We shall use the notation 14:^4 l/Hogt> ([exp(fca»j)]) to denote its log 
version. 

It can easily be checked that the Perron family, HodgeRank and Tropical Rank are 
projections onto the set of strongly transitive matrices [T3] or consistent [IB] matrices, 
which are rank one matrices identified with the score vectors in PM n_1 via w -H- [wi/wj]. 
Its image in A2ffi n under the log map is the subspace spanned by the matrices Y^=i e ij ~ 
Y^k=i e ki f° r i = 1) 2, . . . , n, where e^ is the matrix with 1 in the (i, j)-th entry and else, 
denoted ST- These matrices are so named since for all triples i, j, k the relation Xij = X^- 
Xkj holds, which is natural if one assumes that the comparison matrix is obtained from the 
score in the error-free. Thus one central question in pairwise ranking is the score recovery 
problem: suppose there is a true score w and one observes X, a perturbed version of the 
true comparison matrix [wi/wj]. Which method is 'best', in some sense, at recovering 
wl By specifying an objective function, such as number of items ranked correctly, one 
can convert the comparison of methods into an optimization problem. This formulation 
is commonly found in the broader area of rank learning EJ ED], a rich litearture on 
the study of algorithms for ranking with emphasis on large datasets, predicting the rank 
of new items and ranking the top items correctly (see, for example, [5] and references 
therein) . 

A large class of these optimization problems can be rephrased in A2M n as geometric 
questions on the images under the projection Vk of level sets of the noise distribution. 
The second result of our paper, Theorem [2J describes the fibers of this projection. It also 
contributes towards solving whether there exists an objective function in which Perron 
Rank for k ^ or oo is the optimal solution over all possible ranking algorithms. This 
question arised in the literature since HodgeRank is known to be the ^2-minimizer and 
Tropical Rank is known to be a special point on the set of ^oo-minimizers to ST [3 [T2] . 

Theorem 2. Let 1 = (1, . . . , 1) T be the all one vector, = (0, . . . , 0) T the all-zero vector. 
For all ssK/(l,...,l)) asa set, the fibers of the map Vk satisfy 




The zero-fiber V k (0) can be decomposed as 



V k 7 1 (0) = Q)S l (k)+R-ll T , 



i=l 
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where the Si(k) for i — 1, . . . , n are orthogonals, with 

n 

Si(k) := {a,i. G W l : < 0, V" exp(fc%) = 1} for k G (0, oo), 

3=1 

Si(oo) := {aj. G M n : < 0, = for at least one j = 1, . . . , n}, 



it 



#(0) := {a,6r:^ flij =0} 
i=i 

Since the fibers V^T (s) are just translations of the zero-fiber by [s, — sy], Theorem [2] 
implies that the optimal k* in the score recovery problem considered is independent of the 
true additive score s. For large k, the zero-fiber is the real amoeba of the tropical (max- 
plus) eigenvector map [13]. For k — 0, the zero-fiber is the set of matrices decomposing 
into the all-one matrix plus a matrix whose column sum is zero, which is the zero-fiber of 
the HodgeRank map. Thus we obtain a crude version of Theorem [TJ 

In summary, Theorem [T] and its geometric cousin, Theorem [2] are new mathemati- 
cal results linking HodgeRank and Perron Rank, two popular methods used in pairwise 
ranking. This converts a large class of the score recovery problem into a parametric op- 
timization question with geometric interpretations, contributing towards the solving of 
specific instances of this problem. 

I. Proofs 

1.1. Proof of Theorem [JJ. Equip W 1 with the Euclidean norm. Let || ■ || denote the 
operator norm if the argument is a matrix, and the £2 norm if the argument is a vector. Let 
I be the n x n identity matrix. Our proof of Theorem [1] relies on the following proposition, 
which gives a linear approximation to the 'error term' of the principal eigenvector when 
X is not far from being a strongly transitive matrix. 

Proposition 3. Fix a vector s G M+. Let £ := := [sj/s{ ■ Xij]. Suppose there 
exists a constant k > 1 such that the n x n matrix E = £ — nll T — (1 — k)I satisfies 

P ■= (^|b) 2 < \- Then 

v(X) = s ■ (1 + — (r - fl) + e) (1) 

KTl 

where r — (S ■ 1) is the row sum of S, f = - Y^™ 1 r, is rts mean, and llell < • -ML. 

This proposition is interesting in itself. One could think of s as the true score vector, 
[si/sj] as the true multiplicative comparison matrix, and £ as the multiplicative pertur- 
bation with centered version S, so defined since v(k11 t + (1 — k)I) = 1 for all k > 1. If 
H = then the perturbation does not affect the principal eigenvector, and v(X) is exactly 
s. The proposition states that if the centered perturbation H is small, as measured by p 
and J^j=, then u (X) differs from the true multiplicative score s only by a linear factor plus 
a lower order term. Unfortunately this proposition cannot be applied easily in practice 
since it requires knowledge of k, which depends on the choice of s and in general neither 
are easy to find. For the proof of Theorem [1] it is sufficient to choose k = 1. 

Proof of Proposition^ By [T9l Lemma 2.2], v(X) = s ■ v(£), hence one can assume 
without loss of generality that s = 1. Write £ = k11 t + + — k)I). Then k11 t is a rank 
one matrix with one non-zero eigenvalue pn, corresponding to the normalized eigenvector 



4 NGOC MAI TRAN 

^1. Let Y E R rax(n_1 ) be an orthonormal basis of the zero eigenspace of k11 t . From 
standard results in perturbed linear operators (see, for example, [T%| Theorem 2.7]), p < | 
implies v(£) = -^1 + YP + -7=, where P := w ^_ y T (5 + (1 — ft) 1)1, and the error term 
is bounded by 

^ ^ ~~ 1 — p nn 

Since YY T = I — ^11 T , and since is only defined up a multiplicative constant, we 
have 

v(o = -L U + Jl(i _ Iiit )(h + (i _ K) i)i + e 

= 1 + —(51 - -11 T S1) + e = 1 + — (r - f) + e 

□ 

Proof of TheoremQl Under the notation of Proposition [31 let k — 1, s = 1, so X — £. 
Define A = [Ay] = [logX^]. Then 

/ k t ~ 1 A t 
^ {k) - exp(kA t3 ) -l = k-[A ij + Y t 



i j 



t>2 



Hence for k close to 0, ||5 fc || = k\\A\\ + o(k) and = 0(k 2 ), and Proposition [3] applies. 

Note that r = S^l — k ■ Al + 0(k 2 ), f 1 = k ■ 11 T A1 + 0(k 2 ), and e = 0(k 3 ). After 
applying the logarithm to Equation (TJ Taylor expansion gives 

lim^logt;(X fc ) = lim ( ^-r - ^-f + 0(k 2 ) ] = -Al - -11 T A1 = - \ogh(X) 
fc-s-o k^o\kn kn J n n n 

since the HodgeRank vector is defined only up to additive constants. □ 

1.2. Proof of Theorem [21 It is sufficient to prove equivalent statements on K for the 
map Vfc. The fibers of this map are the set of positive matrices whose principal eigenvector 
is a fixed vector w G 1R" , that is 

}C{w):= K(w,fi):= {X elC:v{X) = w,X{X) = p} 

M6(0,oo) ^e(0,oo) 

where U denote disjoint set union. We call this the one- dimensional real positive Kalman 
variety, motivated by the definition of the Kalman variety in Ottaviani and Sturmfels [H] . 
In general real Kalman varities are difficult to characterize, however the one-dimensional 
positive variety admits a simple description. 



Corollary 4. For each fixed pair (w,X), let \I/ Wi a : /C — > JC be the map [X^] y [Yi 



r 3 Jl Then 



i. ii 



Xwi 

n 

tt MiA (/C(w,A)) = 0(A n 



i=l 



where each (A n _i)i is the interior of the [n — 1)- dimensional simplex on the n • {i — 1) + 1 
to n ■ i coordinate ofM. nxn . 



HODGERANK IS THE LIMIT OF PERRON RANK 



5 



Proof of Theorem [B The first statement follows by |T9l Lemma 2.2] (which is a direct 
computation). By Corollary HI V^ _1 (l) = R • 0" =1 (A„_i)j. By taking log, one obtains 
the stated result for V^. _1 (0). The case for k = oo follows from the convergence of the 
real amoebas to the zero set of the tropical eigenvectors map. As k approaches 0, each 
component Si(k) is flattened, and a little calculation shows that the each component 
converges, up to a translation by a large constant times the (1, . . . , 1) vector, to its tangent 
plane at an = ... = di n , which has equation ^Jj=i a ij = 0- This proves the theorem. □ 
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